arXiv:1507.04330v2 [cs.DC] 16Jul2015 


Optimal Dynamic Distributed MIS 

Keren Censor-Hillel* Elad Haramaty^^ Zohar Kamin^ 

July 17, 2015 


Abstract 

Finding a maximal independent set (MIS) in a graph is a cornerstone task in distributed computing. 
The local nature of an MIS allows for fast solutions in a static distributed setting, which are logarithmic 
in the number of nodes or in their degrees [Luby 1986, Ghaffari 2015]. By running a (static) distributed 
MIS algorithm after a topology change occurs, one can easily obtain a solution with the same complexity 
also for the dynamic distributed model, in which edges or nodes may be inserted or deleted. 

In this paper, we take a different approach which exploits locality to the extreme, and show how 
to update an MIS in a dynamic distributed setting, either synchronous or asynchronous, with only a 
single adjustment, meaning that a single node changes its output, and in a single round, in expectation. 
These strong guarantees hold for the complete fully dynamic setting: we handle all cases of insertions 
and deletions, of edges as well as nodes, gracefully and abruptly. This strongly separates the static and 
dynamic distributed models, as super-constant lower bounds exist for computing an MIS in the former. 

We prove that for any deterministic algorithm, there is a topology change that requires n adjustments, 
thus we also strongly separate deterministic and randomized solutions. 

Our results are obtained by a novel analysis of the surprisingly simple solution of carefully simulating 
the greedy sequential MIS algorithm with a random ordering of the nodes. As such, our algorithm 
has a direct application as a 3-approximation algorithm for correlation clustering. This adds to the 
important toolbox of distributed graph decompositions, which are widely used as crucial building blocks 
in distributed computing. 

Finally, our algorithm enjoys a useful history-independence property, which means that the distri¬ 
bution of the output structure depends only on the current graph, and does not depend on the history 
of topology changes that constructed that graph. This means that the output cannot be chosen, or even 
biased, by the adversary, in case its goal is to prevent us from optimizing some objective function. More¬ 
over, history independent algorithms compose nicely, which allows us to obtain history independent 
coloring and matching algorithms, using standard reductions. 
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1 Introduction 


Dynamic environments are very common in distributed settings, where nodes may occasionally join and 
leave the network, and communication links may fail and be restored. This makes solving tasks in a dynamic 
distributed setting of fundamental interest: Indeed, it is a widely studied area of research, especially in the 
context of asynchronous self-stabilization Ill71ll81l26ll50l . and also in the context of severe graph changes, 
whether arbitrary or evolving randomly Q. In this paper, we consider a dynamic distributed setting 
which is synchronous and assumes topology changes with sufficient time for recovery in between, as is 
typically assumed in the literature on sequential dynamic algorithms lITSl . 

Solutions for problems from the static distributed setting translate nicely into our dynamic distributed 
setting, by running them in response to topology changes, in order to adjust the output ll5ll6llTO . This can be 
quite efficient especially for local problems, such as finding a maximal independenf sef (MIS) in fhe network 
graph. The cornerstone MIS problem admits fast distributed solutions whose complexities are logarithmic 
in the size of the graph or in the degrees of the nodes Il2l l25l[3ni43l . 

In this paper, we exploit locality to the extreme, and present an MIS algorithm for the dynamic dis¬ 
tributed setting, both synchronous or asynchronous, which requires only a single adjustment, where the 
adjustment measure of an algorithm is the number of nodes that need to change their output in response to 
the topology change, and a single round, in expectation. These strong guarantees hold for the complete fully 
dynamic setting, i.e., we handle all cases of insertions and deletions, of edges as well as nodes, gracefully and 
abruptly^ This is a strong separation between the static and dynamic distributed models, as super-constant 
lower bounds exist for the static setting II371I42L We further prove that for any deterministic algorithm, there 
is a topology change that requires n adjustments, with n being the number of nodes, thus we also strongly 
separate deterministic and randomized solutions. Below, we overview our technique and the applications of 
our result. 

1.1 Our Contribution 

Our approach is surprisingly simple: We simulate the greedy sequential algorithm for solving MIS. The 
greedy sequential algorithm orders the nodes and then inspects them by increasing order. A node is added 
to the MIS if and only if it does not have a lower-order neighbor already in the MIS. We consider ran¬ 
dom greedy, the variant of greedy in which the order is chosen uniformly at random. Consider simulating 
random greedy in a dynamic environment with the following template (ignoring the model of computa¬ 
tion/communication for the moment). Each node needs to maintain the invariant that its state depends only 
on the states of its neighbors with lower order, such that it is in the MIS if and only if none of its lower 
order neighbors are in the MIS. When a change occurs in the graph, nodes may need to change their output, 
perhaps more than once, until they form a new MIS. Our key technical contribution is in proving: 

Theorem [1] For any arbitrary change in the graph, the expectation over all random orders, of the number 
of nodes that change their output in the above random greedy template is at most 1. 

The Challenge: We denote by vr the random order of nodes, we denote by v* the only node (if any) 
for which the above invariant does not hold after the topology change, and we denote by S the set of nodes 
that need to be changed in order for the invariant to hold again at all nodes. We look at S', the set of nodes 
that would have needed to be changed if the order was as in vr, except for pushing v* to be the first node 
in that order. The definition of S' does not depend on the real order of v* in vr. Therefore, we can prove 
that S can either be equal to S' if the order of v* in tt is minimal in S', and empty otherwise. Now the 

*See definitions of topology changes in Sectionj^ 
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question is, what is the probability, given S', that v* is indeed its minimal order node? The answer is that if 
S' were deterministic, i.e. independent of vr, the probability would be l/IS"!. However, S' is a random set 
and having knowledge of its members restricts vr to be non-uniform, which in turn requires a careful analysis 
of the required probability. To overcome this issue, we prove that the information that S' gives about vr is 
either about the order between nodes not in S', or about the order within which both do not effect 

the probability that v* is the minimal in S'. 

Distributed Implementation: This powerful statement of IE[|5|] < 1 directly implies that a single 
adjustment is sufficient for maintaining an MIS in a dynamic setting. A direct distributed implementation of 
our template implies that in expectation also a single round is sufficient. This applies both to the synchronous 
and asynchronous models, where the number of rounds in the asynchronous model is defined as fhe longesf 
pafh of communication. 

Obtaining 0(1) Broadcasts and Bits: In fact, in the synchronous model, it is possible to obtain an 
expected number of 0(1) broadcasts and bits. Here the number of broadcasts is the total number of times, 
over all nodes, that any node sends a O (log n)-bit broadcast message (to all of its neighborsjl. Moreover, 
since we only need a node to know the order between itself and its neighbors, using a similar technique to 
that of 1431 . we can obtain that in expectation, a node only needs to send a constant number of bits in each 
broadcast. The above holds for edge insertions and deletions, graceful node deletion, and node unmuting, 
while for an abrupt deletion of a node v* we will need O (min{log(n), (i(r;*)}) broadcasts, and for an 
insertion of a node v* we will need 0{d{v*)) broadcasts, in expectation. 

This is done with a careful dynamic distributed implementation which guarantees that each node that 
changes its output does so at most 0(1) times, as opposed to the direct distributed implementation H. Hence, 
obtaining these broadcast and bit complexities comes at a cost of increasing the round complexity, but it 
remains constant (albeit not 1). In what follow, we focus on this result since we find it intriguing that we can 
get as little as 0(1) total communication, while paying 0(1) rounds instead of a single round is arguably 
not a big cost. 

Matching Lower Bounds: We claim that any deterministic algorithm requires n adjustments, which 
can be seen through the following example. Let A be a dynamic deterministic MIS algorithm. Let Go be 
the complete bipartite graph over two sets of nodes of size k. We denote by L the side of Go that is chosen 
to be the MIS by A, and we denote the other side by R. For every i G [k] let Gj be the graph obtained after 
deleting i nodes from L, and consider executing A on Go, Gi,..., G^. For every i, since Gj is a complete 
bipartite graph, one of the sides has to be the MIS. Since Gk contains only disconnected nodes of R then R 
is the only MIS of G^. This implies that after some specific change along the sequence, the side of the MIS 
changes from L to R. In this topology change, all of the nodes change their output. 

This gives a strong separation between our result and deterministic algorithms. Moreover, it shows that 
(1) the expected adjustment complexity of any algorithm must be at least 1, as we have a sequence of k 
topology changes that lead to at least k adjustments, and (2) it is impossible to achieve high probability 
bounds that improve upon a simple Markov bound. Specifically, this explains why we obtain our result 
in expectation, rather than with high probability. This is because the example can be inserted into any 
larger graph on n nodes, showing that/or every value of k, there exists an instance for which at least Q,{k) 
adjustments are needed with probability at least 1/k. 

Approximate Correlation Clustering: In addition to the optimal complexity guarantees, the fact that 
our algorithm simulates the random greedy sequential algorithm has a significant application to correlation 

^We emphasize that the term broadcast is used here to indicate the more restricted setting of not being able to send different 
messages to different neighbors in the same round. It does not refer to a wireless setting of communication. 

^This bears some similarity to the method in 1531 . where the number of moves is reduced in an MIS self-stabilizing algorithm 
by adding a possible wait state to the standard in MIS and not in MIS states. 
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clustering. Correlation clustering requires the nodes to be partitioned into clusters in a way that minimizes 
the sum of the number of edges outside clusters and the number of non neighboring pairs of nodes within 
clusters (that is, missing edges within clusters). Ailon et al. show that random greedy obtains a 3- 
approximation for correlation clustering by having each MIS node inducing a cluster, and each node not 
in the MIS belonging to the cluster induced by the smallest random ID among its MIS neighbors. This 
directly translates to our model, by having the nodes know that random ID of their neighbors. Graph 
decompositions play a vital role in distributed computing (see, e.g., I(47l ). and hence the importance of 
obtaining a 3-approximation for correlation clustering. 

History Independence; Finally, our algorithm has a useful property, which we call history indepen¬ 
dence, which means that the structure output by the algorithm (e.g., the MIS) depends only on the current 
graph, and does not depend on the history of topology changes. This means that the output cannot be chosen, 
or even biased, by the adversary, in case its goal is to prevent us from optimizing some objective function. 
Moreover, history independent algorithms compose nicely, which allows us to obtain history independent 
coloring and matching algorithms, using standard reductions. 

1.2 Related Work 

Distributed MIS: Finding an MIS is a central theme in the classical distributed setting. The classic algo¬ 
rithms ||2j[3Tl|43l complete within O(logn) rounds, with high probability. More recently, a beautiful line 
of work reduced the round complexity to depend on A, the maximal degree in the graph. These include 
the 0(A -|- log* n)-round algorithm of fg], the 0(log AyToglrj-round algorithm of ifTOl . and the very re¬ 
cent 0(log A) -|- 2 '^(v'toglog n )-round algorithm of ll25ll . An excellent source for additional background and 
relations to coloring can be found in [8]. 

Distributed dynamic MIS: The problem of finding a fast dynamic distributed MIS algorithm appears 
as an open problem in 1211 . which studies the problem of maintaining a sparse spanner in this setting. 
Additional problems in this setting are also addressed in ifTTI . and in slightly different settings in ll4l [T4l[32l 
l35l . However, we are unaware of any other work in this setting about maintaining an MIS. One standard 
approach for maintaining an MIS is running distributed algorithms that are designed for the static setting. 
This can be done for any distributed algorithm, sometimes using a corresponding compiler, e.g., when 
applied to an asynchronous dynamic setting |l5l|6l|40l. One important exception is the solution in 1341 . 
which as in our algorithm, requires a constant number of rounds, but as opposed to our algorithm, makes the 
strong assumptions that (1) a node gracefully departs the network, and (2) messages may have unbounded 
size. An additional difference is that the number of broadcasts, as opposed to the number of rounds, may be 
large. 

Additional distributed dynamic algorithms: A huge amount of literature is devoted to devising dif¬ 
ferent algorithms in a self-stabilizing setting (see, e.g., l[T^I26ll50l and references therein). This setting 
is inherently different from ours since it measures the time it takes an algorithm to reach a correct output 
starting from any arbitrary configuration. The setting is asynchronous, but considers a notion of time that 
is different than ours, where an asynchronous round requires that each node communicates with all of its 
neighbors. This inherently implies a lot of communication (broadcasts). 

An MIS-based clustering algorithm for the asynchronous model that appeared in ll20l also uses a random 
node order for recovering after a change. However, their self-stabilizing setting differs from ours in several 
aspects, such as assuming a bounded degree graph and discussing corrupted states of multiple nodes, and 
multiple topology changes. In addition, our techniques and analysis are completely different. In particu- 

"'in the same paper, they also provide a 2.5 approximation based on rounding a solution of a linear program. We do not elaborate 
on the details of this algorithm, nor the history of the correlation clustering problem as it is outside the scope of our paper. 
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lar, the clustering obtained there may not be an approximation to correlation clustering. Furthermore, the 
number of rounds required by 1201 is 0(log(n)) as opposed to the single round algorithm (in expectation) 
presented here. 

Related, but not identical, notions of error confinement, fault local and fault amendable algorithms have 
been studied in |l7l|38l|39l, where the internal memory of a node may change. Another property that self- 
stabilizing algorithms should aim for is super-stabilization (191, which means that they are self-stabilizing 
(eventually output the required structure) and also recover extremely fast from a single topology change. 
Super-stabilization requires also a small adjustment measure, which is the maximum number of nodes that 
have to change their output. Our MIS algorithm recovers from a single topology change in a single round, 
and has an adjustment measure of exactly 1, in expectation. 

Simulating the sequential greedy algorithm: Simulating random greedy has been used before in order 
to obtain fast solutions for sequential local computation algorithms (LCA). In this setting, the algorithm does 
not have access to the entire graph, but rather an oracle access using queries about nodes or edges, and needs 
to provide an approximate solution for various problems, among which are the problems considered in this 
paper. We emphasize that the models are inherently different, and hence is our technical analysis. While we 
bound the size of the set of nodes that may change their output after a topology change, studies in the local 
computation literature 1141114411^l54l bound the size of the set of nodes that need to be recursively queried 
in order to answer a random node query. In some sense, these sets are opposite: We begin with a single node 
V changing its state due to a topology change, and look at the set of nodes that change their state due to the 
change of v. Local computation algorithms begin with a single node v and look at the set of nodes whose 
states determine the state of v. 

2 Dynamic Distributed Computations 

The distributed setup is a standard message passing model. The network is modeled by an undirected 
graph G = {V, E) where the node set is V, and E consists of the node pairs that have the ability to directly 
communicate. We assume a broadcast setting where a message sent by a node is heard by all of its neighbors. 
Also, we assume a synchronous communication model, where time is divided into rounds and in each round 
any willing node can broadcast a message to its neighbors. We restrict the size of each message to be 
0(log(n)) bits, with n = \V\ being the size of the networlj^ The computational task is to maintain a graph 
structure, such as a maximal independent set (MIS) or a node clustering. That is, each node has an output, 
such that the set of outputs defines the required structure. 

Our focus is on a dynamic network, where the graph changes over time. As a result, nodes may need 
to communicate in order to adjust their outputs. The system is stable is when the structure defined by fhe 
oufpufs satisfies fhe problem requiremenfs. 

A graph topology change can be wifh respecf to eifher an edge or a node. In bofh cases we address bofh 
delefions and insertions, bofh of which are furlher splif into fwo differenl fypes. For deletions we discuss bofh 
a graceful deletion and an abrupt deletion. In fhe former, fhe delefed node (edge) may be used for passing 
messages befween ifs neighbors (endpoinfs), and retires complefely only once fhe system is sfable again. In 
fhe latter, fhe neighbors of fhe delefed node simply discover fhaf fhe node (edge) has refired buf if cannof be 
used for communicafion. For insertions, we distinguish befween a new node insertion and an unmuting of a 
previously exisfing node. In fhe former, a new node is inserted fo fhe graph, possibly wifh mulfiple edges. 
In fhe latter, a node fhaf was previously invisible to ifs neighbors buf heard fheir communication, becomes 

^This is the standard assumption in a distributed setting. In our dynamic setting where the size of the graph may change we 
assume knowledge of some upper bound N > n, with N = a^d restrict the message length to 0(log(A)) = 0(log(n)). 
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visible and enters the graph topolog)!^ 

We assume that the changes are infrequent so that they occur in large enough time gaps, so that the 
system is always stable before a change occurs. We consider the performance of an algorithm according 
to three complexity measures. The first is the adjustment-complexity, measuring the number of nodes that 
change their output as a result of the recent topology change. The second is the round-complexity, which is 
the number of rounds required for the system to become stable. Finally, the third, more harsh, score is the 
broadcast-complexity, measuring the total number of broadcasts. 

Our algorithms are randomized and thus our results apply to the expected values of the above measures, 
where the expectation is taken over the randomness of the nodes. We emphasize that this is the only ran¬ 
domness discussed; specifically, the result is not for a random node in the graph nor a random sequence 
of changes, but rather applies to any node and any sequence of changes. It holds for every change in the 
graph, not only in amortized over all changes. For this, we make the standard assumption of an oblivious 
non-adaptive adversary. This means that the topology changes do not depend on the randomness of the 
algorithm. This standard assumption in dynamic settings is natural also for our setting, as, for example, an 
adaptive adversary can always choose to delete MIS nodes and thereby force worst-case behavior in terms 
of the number of adjustments. 

In what follows we discuss the problem of computing an MIS. Here, the outputs of the nodes define a set 
M, where any two nodes in M are not connected by an edge, and any node not in M has a neighbor in M. 
The second problem we discuss is that of correlation clustering. Here, the objective is to find a partitioning 
C of the node set V, where we favor partitions with a small number of “contradicting edges". That is, we 
aim to minimize the sum J^cec Zu,vec ^[(u,v)^b] + Eci^C 2 eC EneCuveC 2 ^[MeE]- 

3 A Template for Maintaining a Maximal Independent Set 

In this section we describe a template for maintaining a maximal independent set (MIS). Initially, we are 
given a graph G = {V, E) along with an MIS that satisfies certain properties, and after a topology change 
occurs in the graph, applying the template results in an MIS that satisfies the same properties. That is, the 
template describes what we do after a single topology change, and if one considers a long-lived process of 
topology changes, then this would correspond to having initially an empty graph and maintaining an MIS 
as it evolves. We emphasize that the template describes a process that is not in any particular model of 
computation, and later in Section |4] we show how to implement it efficiently in our dynamic distributed 
setting. This also means that there are only four topology changes we need to consider: edge-insertion, 
edge-deletion, node-insertion and node-deletion. For example, the notions of abrupt and graceful node 
deletions are defined with respect to the dynamic distributed setting because they affect communication, and 
therefore the implementation of the template will have to address this distinction, but the template itself is 
only concerned with a single type of node deletion, not in any particular computation model. 

Throughout, we assume a uniformly random permutation tt on the nodes v ^ V. We define two states 
in which each node can be: M for an MIS node, and M for a non-MIS node. We abuse notations and 
also denote by M and M the sets of all MIS and non-MIS nodes, respectively. Our goal is to maintain the 
following MIS invariant: A node u is in M if and only if all of its neighbors u ^ N{v) which are ordered 
before it according to vr, i.e., for which 7r{u) < tt{v), are not in M. It is easy to verify that whenever the 
MIS invariant is satisfied, it holds that the set M is a maximal independent set in G. Furthermore, it is easy 
to verify that this invariant simulates the greedy sequential algorithm, as defined in the introduction. 

When any of the four topology changes occurs, there is at most a single node for which the MIS invariant 

®The distinction is only relevant for nodes insertions, as there is no knowledge associated with an edge. 
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no longer holds. We denote this node by v* = v*{Goid,Gaew,T^), where Gold and Gnew are the graphs 
before and after the topology change. For an edge insertion or deletion, v* is the endpoint with the larger 
order according to vr. For a node insertion or deletion, v* is the node. 0 In case the topology change is 
an edge change, we will need also to take into consideration its other endpoint. We denote it by v** = 

(Gold,Gnew,Ti"). and notice that by our notation, it must be the case that 7 r(r;**) < 7t{v*). In order 
to unify our proofs for all of the four possible topology changes, we talk about a node v** also for node 
changes. In this case we define v** to be v* itself, and we have that = 7r{v*). Therefore, for any 

topology change, it holds that 7r(r;**) < 7r(r;*). 

To describe our template, consider the case where a new edge is inserted and it connects two nodes 
7 r(r;**) < 7 r(r;*), where both nodes are in M. As a result, v* must now be deleted from the MIS and hence 
we need to change its state. Notice that as a result of the change in the state of v*, additional nodes may 
need their state to be changed, causing multiple state changes in the graph. An important observation is that 
it is possible that during this process of propagating local corrections of the MIS invariant, we change the 
state of a node more than once. As a simple example, consider the case in which v* has two neighbors, ui 
and U 2 , for which tt{v*) < 7r(tti), 7 r(u 2 ), and that ui and U 2 are connected by a path {ui,wi,W 2 ,U 2 ), with 
7 r(tti) < 7 r(mi) < tt{w 2 ) < vr(tt 2 ). Now, when we change the state of v* to M, both ui and U 2 need to be 
changed to M, for the MIS invariant to hold. This implies that wi needs to be changed to M and W 2 needs 
to be changed to M. In this case, since 7r{w2) < vr(ti 2 ), the node U 2 needs to be changed back to state M. 

The above observation leads us to define a sef of influenced nodes, denofed by S' = S(Goid, Gnew, 
confaining v* in fhe scenario where we need fo change ifs sfafe, and all ofher nodes whose sfafe we musf 
subsequenfly change as a result of the state change of v*. To formally define fhe sef S we infroduce some 
nofafions. The nofafions rely on fhe graph sfrucfure of Gnew unless fhe change is a node deletion in which 
case fhe rely on Gom- For each node u, we define = {f G A^(rt) | ^{v) < 7 r(u)}, fhe sef of neighbors 
of u fhaf are ordered before if according fo tt. These are fhe nodes fhaf can pofenfially influence fhe sfafe of 
u according fo fhe MIS invarianf. The definition of S is recursive, according to the ordering induced by vr. If 
immediately after the topology change, in the new graph G with the order vr it holds that the MIS invariant 
still holds for v*, then we define S = 0. (This is mofivated by fhe facf fhaf no node is influenced by Ibis 
change.) Ofherwise, we denote Sq = {n*}, and inductively define 

Si = {u\u ^ M, and 5j_i n I-iriu) / 0} U {u | tt G M, and every v G I-k{u) n M is in U*“q Sj)}. (1) 

The sef S is fhen defined as S' = IJ^ S*. Nofice fhaf a node u can be in more fhan one sef Si, as is fhe 
case for U 2 in fhe example above, which is in bofh Si and S 4 . The impacf of a node u being in more fhan one 
Si is fhaf in order fo mainfain fhe MIS invarianf, we need fo make sure fhaf we update fhe sfafe of u affer we 
update fhaf of w, for any w such fhaf w G I-k{u). Instead of updating the state of u twice, we can simply wait 
and update it only after the state of every such w is updated. For this, we denote by iu = max{t | u G S^} 
the maximal index i for which u is in Si. 

We formally describe our template in Algorithm [T] By construction, the updated states after executing 
Algorithm [T] satisfy the MIS invariant. In addition, the crucial property that is satisfied by fhe above template 
is fhaf in expecfafion, fhe size of fhe sef S' is 1. The remainder of fhis secfion is devofed fo proving fhe 
following, which is our main technical result. 

Theorem 1. For every two graphs Gom tind Gnew that differ only by a single edge or a single node, it holds 
that Ejr [|S(Goid , Gnew, ^) |] — !• 

’For a node deletion, we slightly abuse the definition of v* in order to facilitate the presentation, and consider it to be the deleted 
node. This means that here we consider an intermediate stage of having v* still belong to the graph w.r.t. the MIS invariant of all 
the other nodes, but for v* the MIS invariant no longer holds. This is in order to unify the four cases, otherwise we would have to 
consider all of the neighbors of a deleted node as nodes for which the MIS invariant no longer holds after the topology change. 
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Algorithm 1 A Template for Dynamic Correlation Clustering. 
Initially, G = {V, E) satisfies the MIS invariant. 

On topology change at node v* do: 

1. Update state of v* if required for MIS to hold 

2. For i •(— 1, until Si = 0, do: 

3. For every u £ Si such that i = iu- 

4. Update state of u 

5. f •(— z + 1 


Outline of the proof: In order to prove that E[|S'|] < 1, instead of analyzing the set S directly, we analyze 
the set S' = S' {Gold, G new, TT, n*), which is defined via recursion similarly to S with three modifications: 
(1) It is always the case that S'q = {n*} (2) The graph according to which S' is defined is Gom in the case of 
a node deletion or an edge insertion, and Gnew otherwise. (3) The permutation according to S' is defined as 
vr', that is identical to vr other than its value for v* that is forced to be the minimal among all other vr values. 
Notice that S' does not depend on 'k{v*) and in particular, having knowledge about its elements does not 
give any information as to whether 7r(n*) < 'k{v**) or vice versa. 

In LemmalU we prove that if 7r(r;*) / min {'k{u) \ u £ 5'} then 5 = 0, and otherwise S = S' (in fact, 
it would be enough that 5 C S'). Then, in Lemma |3 we prove that for any set P <£ V, given the event 
that P = S', the probability, over the random choice of vr, that tt{v*) = min{7r(u) | u £ P} is I/IF*!. This 
leads to the required result of Theorem [T] Lemma |3] would be trivial if there was no correlation between 
vr and S'. However, the trap we must avoid here is that S' is defined according to vr, and therefore when 
analyzing its size we cannot treat vr as a uniformly random permutation. To see why, suppose we know that 
inside S' \ {n*} we have nodes with large order in tt. Then the probability that the order of v* in vr is smaller 
than all nodes in 5'\ {n*}, is much larger than 1/|5'|, and can in fact be as large as 1 — o(l). In other words, 
S' gives some information over vr. Nevertheless, we show that this information is either about the order 
between nodes outside of S', or about the order between nodes within 5' \ {n*}. Both types of restrictions 
on TT do not affect the probability that v* is the minimal of S'. 

We now formally prove our result as outlined above. Throughout we use the notation u £ M oxu £ M. 
This applies only to nodes u for which we are guaranteed that their states remain the same despite the 
topology change. 

Lemma 2. If tt{v*) min{ 7 r(tt) | u £ 5'} then 5 = 0. Otherwise, 5 C S'. 

Proof. First, assume that 7 r(n*) / min {vr(tt) | u £ 5'}. We show that the MIS invariant still holds after the 
topology change, and so 5 = 0. Consider the node w, for which tt{w) = min{ 7 r(tt) | u £ 5'}. Notice that 
w ^ S, because 7r{w) < tt{v*). We claim that w £ M. Assume, towards a contradiction, that w £ M. This 
implies that w has a neighbor u £ M such that 7 r(rt) < 7 r(m). For this node u we must have u ^ S' due to 
the minimality of tt{w). It follows, according to the construction of S' that w cannot be an element of S', 
leading to a contradiction. 

We have that w £ M and due to the minimality of 7r{w), it must be that w £ 5^, which implies that w 
is a neighbor of v*. But then, when considering 5, v* has a neighbor other than v** which is ordered before 
it according to tt which is in M. In the case of an edge insertion or deletion, this means that v* remains in 
M despite the topology change meaning that 5 = 0. In the case of a node deletion, v* was not in M prior 
to the change hence 5 = 0. In the case of a node insertion, v* does not enter M hence again, 5 = 0. 

Next, assume that tt{v*) = min{ 7 r(u) | u £ 5'}. We show that either 5 = 0 or 5 = 5'. If there is 
no need to change the state of v* as a result of the topological change then 5o = 0, and so 5 = 0 and the 
claim holds. It remains to analyze the case where 5o = 5 q = {n*}. If tt G 5^ then 7 r(t;*) < 7 r(n) hence 
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according to its definition u G Si. If u ^ S[ then u must have a neighbor w £ M with tt{w) < 7r{u) 
meaning that u ^ Si. We have that 5i = 5^ and similarly Si = 5- for alH > 1. We conclude that S' = S 
as required. □ 


The following lemma shows that the probability of having S = S' is 1/|5'|, which immediately lead to 
Theorem [T] as the only other alternative is 5 = 0. 

Lemma 3. For any set of nodes P fV, it holds that 

Pr [vr(r;*) = min{7r(M) | u € P} | 5^ = P and ' k { v **) < = |^. 

To prove this lemma we focus on S'. Notice that the events we considered in the previous lemma 
depend only on the ordering implied by vr and hold for any configuration of states for the nodes that satisfy 
the MIS invariant. Roughly speaking, the lemma will follow from the fact the the event S' = P does not 
give any information about the order implied by vr between nodes in P and nodes in V\P. To this end, for 
every permutation r on V, we define S'{t) = S'{Go\d, Gaew^T, v*) as the set corresponding to S' under the 
ordering induced by r. We denote by lip the set of all permutations r for which it holds that S' (r) = P. 
We first need to establish the following about permutations in Tip: If vr and a are two permutations on V 
such that ttIp = a\p and 7r|y\p = a\v\p, then a G lip if and only if vr G lip. 

Claim 4. Let P f V be a set of nodes, and let vr and a be two permutations such that tt\p = fT|p and 
7r|y\p = (T|y\p. Assume TT G lip. Wc have that V\P C 1/ \ S'{a) and every u G V\P has the same state 
according to vr and a. 

Proof. Let u G V\P. We prove that u £ V \ S'{a) and that its state under a is the same as it is under vr by 
induction on the order of nodes in V\P according to vr (which is equal to their order according to a). 

For the base case, assume that u has the minimal order in V\P. We claim that u cannot have a neighbor 
in P. Assume, towards a contradiction, that u has a neighbor w £ P. Since w £ P then it is possible that 
after the a w will be in M. Since two nodes in M cannot be neighbors and u ^ P, then u must be in M 
according to vr. In this case there is a node 2 G Iniu) n V\P that is in M according to vr. But this cannot 
occur due to the minimality of 7r{u) in V\P. Therefore, u has no neighbors in P as required. 

We have that all of the neighbors of u are in V\P and that u is the minimal among its neighbors 
according to vr. Since vr|y\p = (T|yyp we have that u has the minimal order among its neighbors according 
to a. This translates into u having a state of M under a and in particular, u is not an element of S'(a), thus 
proving our base case. 

For the induction step, consider a node u £ V\P, and assume the claim holds for every w £ V\P H 
Itt{u). We consider two cases, depending on whether u has a neighbor in P or not. 

Case 1: u does not have any neighbor in P. \fu £ M, then there is a node z G H V\P that is 

in M according to vr. By the induction hypothesis, z G P \ S'{a) and z £ M also according to a. Since 
7r|y\P = (T|y\p, we have that u is in M according to a too. Otherwise, if ti G M, then every w £ 

(which is also V\P) is in M according to vr. Any node w £ Ia{u) is also inw £ In{u), since it is not in P 
and 7r|yyp = (T|yyp. The induction hypothesis on w gives that it is also in P \ S'{a) (otherwise it would be 
in = P in contradiction to the assumption of case 1), and its state according to a is M. Hence, u must 
be in P \ S'{a) as well, and in state M according to a. 

Case 2: Assume that u has a neighbor w £ P. Since w £ P then it is possible that after the algorithm 
w will be in M. Since two nodes in M cannot be neighbors and u ^ P, then u must be in M according 
to vr. In this case there is a node z G lTr{u)\P that is in M according to vr. By the induction hypothesis, 
z G P \ S'{a) and z £ M also according to a. Since Tr\v\p = f^ly^p, we have that n is in M and in 
P \ S'{o) according to a too. □ 


Claim 5. Let P <Z V be a set of nodes, and let vr and a be two permutations such that 7 r|p = a\p and 
7 r|y\p = cr\y\p. Assume vr € lip. We have that P C S'{a). 

Proof. We prove that every node u G P is also in S' (cr) by induction on the order of nodes in P according 
to vr (which is equal to their order according to a), with the modification forcing v* to be the first among the 
nodes of P. The base case is for v*, which is clearly in both sets S'{'k) and S'(a). Consider a node u ^ P 
and assume that the claim holds for every node in P which is ordered before u according to tt. Since u £ P 
and u V* there must be some w G It^{u) r\ P since 7r|p = a\p and u £ P v/e. have according to our 
induction hypothesis that w £ S'{o-), meaning that Ia{u) n S'{a) is non-empty. 

Consider now an arbitrary w £ Ia{u). If w £ P then since 7r|p = a\p and u £ P v/e have according 
to our induction hypothesis that w £ S'{a). If w ^ P then it must be the case that w £ M according to vr, 
otherwise u cannot be in P. We thus have according to Claim |4] that (1) m G C \ S'(a) and (2) w £ M 
according to a. It follows that all neighbors of u in Ia{u) are either in S'{a) or in M according to a, hence 
since laiu) n S'(a) / 0 it must be the case that u £ S'(a). □ 

Claims m and [5] combined imply that if 7 r|p = a\p and 7 r|yyp = <t\v\p then a £ lip if and only if 
TT G lip. We are now ready for the proof of Lemma |3l 

Proof (of Lemma Given two permutations and a~ on P\{n*} and V\P, respectively, we define 
as Pa+,a- = Pr [Vti G P,vr(n*) < 7 r(tt) | 7 r|p\{.„*} = and 7r\v\p = cr~] ■ 

First, we observe that for two pairs of permutations af , af and crj, af as above, it holds that p^+ = 

p^+ This is because given the condition for af,af, applying the permutation to nodes 

in P\{n*} and applying the permutation to nodes in V\P has no affect on whether the event 

Vrt G P,7r{v*) < 7 r(u) holds. Next, since Pr [Vu G P,Tr{v*) < 7 r(u)] = we have that for any pair of 
permutations ,a~ on P\{n*} and P\P, respectively: 

— = Pr [Vu G P, 7 r(P) < 7 r(u)] = ^ Pr [ 7 r|p\{^*} = r+ and tt\v\p = r"] 

T+,T~ 

= [^lp\K} = and 7r|y\p = t ~] = p„+^„-. 

Finally, Claims |4] and |5] imply that for every set P C 1/ there is a set of f = tp pairs of permuta¬ 
tions erf),..., (fj^, erf)} on P\{u*} and V\P, respectively, such that lip = {tt | 3 f, 7 r|py|^*} = 

af and 7 r|yyp = o'~}. We conclude that fop a given set P C I/: 

Pr [Vu G P, 7 r(u*) < 7 r(u)] = V p^+ Pr [ 7 r|p\{^*} = f 7 + and 7 r|y\p = er" | vr G Up] 

TrEilp ‘ ^ 

t ^=1 

1 , \ 1 
= M ^ and 7 r|y\p = erf | vr G Hp] = —. (2) 

' ' 2=1 ' ' 

To complete the proof, we argue that knowing that 7r(u**) < 7r(u*) can only decrease the probability that 
7r(u*) < 7r(u) for all u G P. Formally, 

Pr [Vu G P, 7 r(u*) < 7 r(u) | Tr{v**) < ' 7 r(u*)] 

neUp 

= Pr [Vu G P,tt(v*) < 7 r(u) and Tr(v**) < 7r{u) \ Tr{v**) < vr(u*)] 

TTgllp 

Pr^rgHp [Vu G P,7r{v*) < 7 r(u) and Tr{v**) < 7 r(u) and 7 r(u**) < vr(u*)] 

Pr^gnp [vr(u”) < 7r(p)] 

^ PiVgnp [Vu G P,7r{v*) < 7 r(u) and 7 r(u**) < 7 r(u)] 

Pr^gHp [7r(u**) < 7r(u*)] 
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To bound the above expression, we separate our discussion into three possible cases. In the first v** ^ v* and 
V** G P. The value of the expression is clearly 0 in this case. In the second we have v* = v** and according 
to Equation ([Hi we have that the quantity is bounded by 1/|P|. The last case is the one where v** ^ P. 
Here, because v* £ P and v** ^ P we have that the events of 7r(u*) being the minimal in {7^(^i)}^Jgp and 
Tr{v**) being the smaller than each of the elements of {7r{u)}u£P are independent for uniform vr G Tip. 
This is due to the first event being dependent of the inner order inside P and the second being independent 
of the same inside order. Hence, 


Pr^renp [Vu G P,tt{v*) < Tr{u) and 7r{v**) < 7 r(ri)] 

Pr^gnp [vr(n**) < 7r(P)] 

Pr^renp [Vu G P,7r{v**) < 7 r(u)] 


Pr^gHp [iriv**) < Tr{v*)] 


Pr [Vu G P,Tr{v*) < 7 r(u)] 

ttGIIp 


< Pr [Vu G P, 7 r(u*) < 7 r(tt)] < 1/|P| 

Treltp 

□ 

Lemma [2] and Lemma[3]immediately lead to Theorem[T] Also, as an immediate corollary of Theorem[I] 
we get 


Corollary 6 . A direct distributed implementation of Algorithm\2\has, in expectation, both a single adjust¬ 
ment and round, in both the synchronous and asynchronous models. 


4 A Constant Broadcast Implementation 

Theorem [U promises that the expected number of nodes that need to change their output according to our 
template algorithm is 1. However, a direct implementation of the template in Algorithm [T] in a dynamic 
distributed setting may require a much larger broadcast complexity because it may be the case that a node 
needs to change its state several times until the MIS invariant holds at all nodes. This is because a node can 
be in more than a single set Si, as discussed in the previous section. In such a case, despite the fact that the 
expected number of nodes in S' is a constant, it may be that the expected number of state changes is much 
larger. Specifically, in a naive implementafion, the number of broadcasts may be as large as IS'p. Hence, 
although E[|S'|] = 1, the expected number of broadcasts may be as large as n. 

We thus take a different approach for implementing the template in Algorithm [T] in the synchronous 
setting, where each node waits until it knows the maximal i for which it belongs to Si, and changes it 
state only once. This allows to obtain, for almost all of the possible topology changes a constant broadcast 
complexity at the cost of a constant, rather than single, round complexity. 

In order to implement the random permutation vr we assume each node u G P has a uniformly random 
and independent ID G [0,1]. We will maintain the property that each node has knowledge of its i value 
and those of its neighbors. We describe our algorithm in Algorithm |2l This directly applies to the following 
topology changes: edge-insertion, graceful-edge-deletion, abrupt-edge-deletion, graceful-node-deletion and 
node-unmuting. An extension of the analysis is provided in Subsection 14.21 for the case of an abrupt node 
deletion, and a slight modification is provided in Subsection l4.1l for the case of node-insertion. The following 
summarizes the guarantees of our implementation, and is proven in Lemmas l^fTOl and[T3l . 

Theorem 7. There is a complete fully dynamic distributed MIS algorithm which requires in expectation a 
single adjustment and 0(1) rounds for all topology changes. For edge insertions and deletions, graceful 
node deletion, and node unmuting, the algorithm requires 0(1) broadcasts, for an abrupt deletion of a node 
V* it requires O (min{log(n), d(u*)}) broadcasts, and for an insertion of a node v* it requires 0{d{v*)) 
broadcasts, in expectation. 
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In the algorithm a node may be in one of four states: M for an MIS node, M for a non-MIS node, C for 
a node that may need to change from M to M ox vice-versa, and R for a node that is ready to change. We 
will sometimes abuse notation and consider a state as the set of nodes which are in that state. Our goal is to 
maintain the MIS invariant. 

Algorithm 2 MIS Algorithm for node v 
1 : n G M: If some u G I-n-{v) changes to state C, change state to C. 

2: V £ M: If some u G Riv) changes to state C and all other w G Riv) are not in M, change state to C. 
3: V £ C: If (1) all neighbors u with Tr{v) < Tr{u) are not in state C and (2) v changed to state C at least 
2 rounds ago, change state to R. 

4: V £ R: If all u £ It^{v) are in states M or M, change state to M if all u £ It^{v) are in M, and change 
state to M otherwise. 


Any change of state of a node is followed by a broadcast of the new state to all of its neighbors. We 
now define our implementation as a sequence of state changes. When a topology change occurs at node v*, 
if the MIS invariant still holds then v* does not change its state and algorithm consists of doing nothing. 
Otherwise, v* changes its state to C. 

From states M or M, a node changes to state C when it discovers it is in the set S of influenced nodes, 
as defined in Equation ([Til. From state C, a node v changes to state R when (1) none of its neighbors u for 
which 7r(n) < tx{u) are in state C and (2) v changed its state to C at least two rounds ago. Finally, from 
state R a node v returns to states M ox M when all of its neighbors u for which ^{u) < 7r(v) are in states 
M or M. In order to bound the complexity of the algorithm we first show that every node can change from 
state R to either M or M at most once. 

Lemma 8. In Algorithm^ a node u changes its state from R to another state at most once. 

Proof. First, note that every u ^ S never changes its state. Consider a node u changing its state from R to 
either M or M. Since u changes from state R,ifuf^ v* then it must have a neighbor w £ It^{u) that was in 
state C, changed to state R and then changed to M or M. It follows that v* must be the first node to change 
its state from i? to M or M. This event occurs only when all neighbors u of v* are not in C, which in turn 
can happen only when all neighbors of each such u with higher vr value have changed from (7 to i? at least 
once. But, since no node could have changed its state from R to another state before v* has done so, we 
have that when v* changes its state from R to another, all n G 5 are in state R. 

In particular, we have that at the round of the first change of a node from R to another state, there are 
no nodes in state C. Since a node can only change to state C due to a neighbor at state C we have that any 
node changing its state from iZ to M or M will not change its state again, thus proving our claim. □ 

Lemma 9. For edge-insertion, graceful-edge-deletion, abrupt-edge-deletion, graceful-node-deletion and 
node-unmuting, Algorithm^re quires in expectation a single adjustment, 0(1) rounds, and 0(1) broadcasts. 

Proof. Since only nodes in S can change their outputs, the number of adjustments is bounded by I^I, and 
hence is 1 in expectation, by Theorem [T] According to Lemma [8j if a node changes its state then it does so 
exactly three times. First it changes from either M or M to O, then it changes to R, and finally it changes 
to either M ox M again. Since only nodes in S change their states and since the round and broadcast 
complexities are clearly bounded by the number of state changes plus 1 (due to the forced waiting round 
before changing from C to R), the claim follows. □ 
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4.1 Node and Edge Insertion 

When V* is inserted, or an edge {v* ,v**) is inserted, we assume that all new pairs of neighbors are notified 
that they are now connected, along with each other’s ID. In a setting where this is not the case, we make the 
following adjustment, before applying Algorithm [H 

When V* is inserted, in the first round, v* broadcasts its random i value and a temporary state M to its 
neighbors. In the second round, the neighbors of v* broadcast their states and their £ values. Now, it may be 
the case that the MIS invariant does not hold at v*, but it still holds for any other node in the graph. We now 
execute Algorithm |2l The expected number of adjustments hence remains 1, the number of rounds increases 
by two and is therefore still 0 ( 1 ), and the number of broadcasts is now bounded by the degree d{v*). 

When an edge {v*, v** ) is inserted, in the first round, v* and v** broadcast their random £ value and 
state. Now it may be the case that the MIS invariant no longer holds at v*, but it still holds for any other 
node in the graph. We then execute Algorithm|2l The expected number of adjustments hence remains 1, the 
number of rounds increases by one and is therefore still 0 ( 1 ), and the number of broadcasts is still bounded 
by 0 ( 1 ). 

We therefore get the following: 

Lemma 10. For a node-insertion of a node v*, our algorithm requires in expectation a single adjustment, 
0(1) rounds, and 0{d{v*)) broadcasts. For an edge-insertion of an edge {v*,v**), our algorithm requires 
in expectation a single adjustment, 0 ( 1 ) rounds, and 0 ( 1 ) broadcasts. 

4.2 Abrupt Node Deletion 

Consider a node v* that is abruptly deleted. We denote by nj, U 2 ,..., n* the set 5i = ^(Goid, Onewj tt, u*)i. 
We execute Algorithm |2l where in the first round, every v*, 1 < i < x changes its state to C (instead of 
having v* broadcast its change state to C). It is straightforward to verify that despite the above modification, 
only nodes in 5 = ^(Goid, Onew, n*) can change to state C throughout the execution. With this mod¬ 
ification, a node may change to state C more than once. However, we show the amount of times this can 
happen is bounded by both the degree of v* and by log(n). 

Lemma 11. The algorithm completes after at most 3151 +2 rounds. 

Proof. Consider a node v that changes to state C in round t. If f > 1, then there is a node u, for 
which 7r(M) < 7r{v), that changes to C in round t — 1. By induction, it follows that there exists a path 
(ui, U 2 ,..., vt), where vi = v* for some 1 < i < x and vt = v, and in addition 7r(nj) < 7r(uj+i) for all 
j < t. Since the latter implies that the nodes are distinct, we have that t < |5|, meaning that after round 
|5| no node changes to C. This gives that after an additional round, nodes begin to change to R. The same 
argument now gives that no node changes to R after round 2|5| + 1. After an additional round nodes start 
changing from R, and therefore, using the same argument again, we have that no node changed its state 
from R after round 3|5| + 2. To complete the proof, we argue that indeed this procedure progresses, hence 
eventually all nodes are in either M or M, with the MIS invariant holding. This holds since two consecutive 
rounds with no state changes implies that the algorithm terminated: After a round with no state change, if 
there are nodes in state C then the node with the maximal tt order among them changes to R. Otherwise, 
if there are nodes in state R then the node with the minimal vr order among them changes to either M or 
M. □ 

Lemma 12. Ifv changes from either M or M to C at rounds t, it does not change to neither M or M again 
before round 3t + 1. Further, each change of v to C can be associated with a different node vj, for some 
\ <i <x. 
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Proof. We prove the lemma by induction on t, where the base case for i = 1 trivially holds, as a node needs 
to make 3 changes. For t > 1, as in the previous lemma, there exists a path {vi,V 2 , ■ ■ ■ ,vt), where vi = v* 
for some 1 < i < x and vt = v, and in addition TT{vj) < 7 r(nj+i) for all j < t. 

Let tji > t + 2be. the first round in which v = vt changes to R. Notice that until that round, vt-i does 
not change to R, and the same holds inductively for vj for all 1 < j < t — 1. Let be the first round in 
which vi changes to R, and let ts be the first round in which vi changes to either M or M. It follows that 
t'ji > tn + t — 1 > 2t + 1, and hence ts >2t + 2. 

Now, let t'g be the first round in which v = vt changes to either M or M. In round t'g — 1 the node vt-i 
must be in either M or M, and inductively we have that at round t's — t + 1, the node vi is in either M or 
M. This implies that t's — t + 1 > ts > 2t + 2, giving t'^ > 3f + 1 as required. 

Further, when v changes its state from i? to M or M, the state of v* is already M or M. This implies that 
if V change its state to C again in time t' > t, then it is due to a change in v*,, for some i' i,l < i' < x. 
(Although it is possible that the path from v*, to v goes through v* again.) This means that each change of 
n to C can be associated with a different node Vi, for some I < i < x. □ 

Lemma 13. For an abrupt deletion of a node v*, our algorithm requires in expectation a single adjustment, 
0 (1) rounds, and 0(min{log(n), (i(n*)}) broadcasts. 

Proof. Now, the adjustment complexity is bounded by |5| and thus by Theorem [Jit is 1 in expectation. 
By Lemma dJ the round complexity is 0(|5|), and thus by Theorem [Jit is 0(1) in expectation. Finally, 
as a corollary from Lemma [T2| we get that the sequence of rounds in which a node v has changed to C 
is fi,f 2 ) ■ ■ ■ itr with ti > 1, fj+i > + 1 and by Lemma [TJ we have also O < 3|5| + 1. It follows 

that r < log 3 ( 0 (|S|)). Moreover, since by Lemma [12] every change to O by a node v can be accounted 
to a different node v*, we have that r < x < d{v*). This gives that the total number of broadcasts is at 
most 0(|5| min{log(|5|),d(n*)}). The claim immediately follows as since IE[|5|] < 1 (Theorem[J) and 
|5|<n. □ 

5 History Independence 

In this section, we define and discuss fhe history independence properfy. 

Motivation: In many well known problems, in addition to computing a feasible solution, it is sometimes 
required to compute an optimal solution with respect to a given objective function. For example, one may 
wish to find an MIS that has a maximal cardinality. Usually, obtaining optimal solutions, or even good 
approximate solutions, is NP-hard and therefore, we cannot expect to obtain such solutions in the distributed 
model. Furthermore, typically such optimization algorithms are tailored to a specific objective function, 
and hence it is required to handle each objective separately. Moreover, in a dynamic setting, it is more 
cumbersome to analyze the guarantees for the value of the objective function. 

Therefore, although we do not wish to consider any specific objective function in the problem descrip¬ 
tion, we do wish that the adversary will lack the ability to choose a feasible solution as she pleases (in this 
case we may assume she chooses the worst solution). In fact, we require that the adversary will even not be 
able to bias the output of the algorithm towards any specific solution. Formally, we define our requirement 
as follows. 

Definition 14. Let A be an algorithm for maintaining a combinatorial graph structure P in a dynamic 
distributed setting. We say that A is history independent if given a graph G, the output P of A is a random 
variable whose distribution depends only on G, and does not depend on the history of topology changes that 
constructed G. 
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Composability: Another advantage of history independent algorithms is that they compose nicely. For 
example, given a history independent algorithm A for MIS, we can simulate A on the line graph L(G) for 
obtaining a history independent algorithm B for maximal matching. Alternatively, we can simulate A on a 
graph G" = /(G) in which every node v mG corresponds to a clique of size A + 1 in G', and every edge 
in G corresponds to a matching between the appropriate cliques. This standard reduction by 1431 is known 
to give an algorithm G for (A + l)-coloring in G, and since A is history independent then so is G. We 
note that performing the above simulations is non-trivial, as the simple topological changes in G translate 
into more complex changes in L{G) or G', yet, the challenges are only technical and require no additional 
insights, and hence we omit the details. 

Examples: It is straightforward to see that Algorithm |2] is a history independent MIS algorithm, because 
its output for any graph G is identical to the output of the greedy sequential algorithm on G, regardless of 
the topology changes that resulted in G. To further exemplify what this important property gives, consider 
the following examples, in which we compare executions of history independent algorithms to worst case 
solutions. Although history dependent algorithms do not necessarily yield the worst solution, the natural 
history dependent algorithm indeed yields the worst solution in these examples. Here we think of the natural 
algorithm as the greedy algorithm that gives every new node or edge the best value that is possible without 
making any global changes. For this natural algorithm, one can easily verify that for any feasible output 
there is a pattern of topology changes that can force the algorithm to produce it. 

Example: MIS in a Star. Assume that an adversary controls the topology changes in the graph and 
chooses them so that a graph Ggtar is created, where Ggtar is a star on n nodes. Since our MIS algorithm 
simulates random greedy, there is a probability of 1/n that the center of the star has the lowest order among 
all nodes, in which case it is the only node in the MIS. With the remaining probability of 1 — 1/n, a different 
node has the lowest order, which results in the MIS being all nodes except the center, which is the largest 
MIS possible in Ggtar- The expected size of the resulting MIS is therefore linear in n, implying that it is 
within a constant factor of the size of the maximum independent set (maximal cardinality independent set). 
For comparison, recall that the worst-case MIS in a star is the center alone, and its size is 1. 

Example 2: Maximal matching of many 3-paths. Assume an adversary constructs a graph G^paths, 
which contains n/4 disjoint paths of length 3 edges. Since our maximal matching algorithm simulates a 
random greedy MIS algorithm on the line graph L{G^paths), we have that for every 3-path independently, 
with probability 2/3 its matching is of size 2 and with probability 1/3 its matching is of size 1. Therefore, 
the expected size of the matching we obtain is 5re/12. For comparison, notice that the worst-case maximal 
matching in G^paths has size n/4. 

Example 3: Coloring. Regarding coloring algorithms, there is much more room for improvement upon 
using the standard reduction with our MIS algorithm. As in our MIS algorithm, we would have liked to have 
an algorithm that simulates the sequential random greedy algorithm for coloring as well, since, for example, 
it would imply the following. 

Assume that an adversary controls the topology changes in the graph and chooses them so that a graph 
G = {V, E) is created, where G is a bipartite graph on the set of nodes V = L U i?, for L = {ui, - ■ ■, Un/ 2 } 
and R = {ui,..., u„/ 2 }- In E we have an edge between the nodes Ui and Vj for every i / j. Thus, G is a 
complete bipartite graph minus a perfect matching. If we run a random greedy coloring algorithm, then the 
first node, say Ui gets the color 1 when being inserted into the graph. If the next inserted node after Ui is 
Vj, for any j / i, then it gets the color 2, and afterwards every node in L gets color 1 and every node in R 
gets color 2. If the next inserted node after Ui is Uj, for any j / i, then it gets the color 1, and afterwards 
every node in L gets color 1 and every node in R gets color 2. That is, with probability 1 — 1/n, we get an 
optimal 2-coloring. With the remaining 1/n probability we might get a coloring as bad as linear in A. This 


14 


gives that in expectation, we get a coloring whose palette size is a constant factor away from optimal. 

We can, of course, simulate the random greedy sequential coloring, but the problem is that we pay a 
cost of 2^ adjustments. The way we can do this is to maintain that every color i is an MIS in the graph 
G — where Cj is the set of nodes of color j. However, a change to a node may cause each node 

in S to induce 2 sources of sets for the next color, which would result in a total set of 2^ adjustments. Notice 
that this is much worse compared to what a naive dynamic distributed coloring algorithm would give. It is a 
curious question whether we can indeed enjoy both worlds here, or whether any lower bound can be proved. 

The above examples illustrate how a property of the output of a history independent algorithm on a graph 
G can be analyzed as a simple combinatorial problem. This can lead to better guarantees compared to only 
being able to assume the worst case. 

6 Discussion 

This paper studies computing an MIS in a distributed dynamic setting. The strength of our analysis lies in 
obtaining that for an algorithm that simulates the sequential random greedy algorithm, the size of the set of 
nodes that need to change their output is in expectation 1. This brings the locality of the fundamental MIS 
problem to its most powerful setting. 

We believe our work sets the ground for much more research in this crucial setting. Below we discuss 
some open problems that arise from our work. 

An immediate open question is whether our analysis can be extended to cope with more than a single 
failure at a time. Second, there are many additional problems that can be addressed in the dynamic dis¬ 
tributed setting, especially in the synchronous case. We believe that our contribution can find applications 
in solving many additional dynamic distributed tasks. 

A major open question is whether our techniques can be adapted to sequential dynamic graph algorithms, 
which constitutes a major area of research in the sequential setting lll21ll31IT^l22l - l2^IT7]4301l33ll481l491l5n 
l52l . A formal definition and description of typical problems can be found in, e.g., |[T5l . Notice that 
our algorithms are fully dynamic, which means that they handle both insertions and deletions (of edges 
and nodes). Although our template for finding an MIS can be easily implemented in a sequential dynamic 
setting, it would come with a cost of at least 0(A) for the update complexity in a direct implementation. This 
is because we would have to access neighbors of the set of nodes analyzed in Theorem [T] Our distributed 
implementation avoids this by having them simply not respond since they do not need to change their output, 
and hence they do not contribute to the communication. Nevertheless, we believe that our approach may be 
useful for designing an MIS algorithm for the dynamic sequential setting, and leave this for future research. 
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